Large language models (LLMs) have exploded in popularity in the past few years and have achieved undeniably impressive results on benchmarks as varied as question answering and text summarization. We provide a simple new prompting strategy that leads to yet another supposedly "super-human" result, this time outperforming humans at common sense ethical reasoning (as measured by accuracy on a subset of the ETHICS dataset). Unfortunately, we find that relying on average performance to judge capabilities can be highly misleading. LLM errors differ systematically from human errors in ways that make it easy to craft adversarial examples, or even perturb existing examples to flip the output label. We also observe signs of inverse scaling with mode...
The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled ...
Language models (LMs) can be directed to perform target tasks by using labeled examples or natural l...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
AI systems are becoming increasingly intertwined with human life. In order to effectively collaborat...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Assessments of algorithmic bias in large language models (LLMs) are generally catered to uncovering ...
Prompts have been the center of progress in advancing language models' zero-shot and few-shot perfor...
In a recent letter, Dillion et. al (2023) make various suggestions regarding the idea of artificiall...
The development of highly fluent large language models (LLMs) has prompted increased interest in ass...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
As large language models (LLMs) have become more deeply integrated into various sectors, understandi...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
In recent years Artificial Intelligence (AI), especially deep learning, has proven to be a technolog...
Large language models (LLMs) have demonstrated impressive capabilities in natural language understan...
The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled ...
Language models (LMs) can be directed to perform target tasks by using labeled examples or natural l...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...
AI systems are becoming increasingly intertwined with human life. In order to effectively collaborat...
In the present study, we investigate and compare reasoning in large language models (LLM) and humans...
Assessments of algorithmic bias in large language models (LLMs) are generally catered to uncovering ...
Prompts have been the center of progress in advancing language models' zero-shot and few-shot perfor...
In a recent letter, Dillion et. al (2023) make various suggestions regarding the idea of artificiall...
The development of highly fluent large language models (LLMs) has prompted increased interest in ass...
Abstract reasoning is a key ability for an intelligent system. Large language models achieve above-c...
Large language models (LLMs) have achieved remarkable advancements in the field of natural language ...
As large language models (LLMs) have become more deeply integrated into various sectors, understandi...
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledge ...
In recent years Artificial Intelligence (AI), especially deep learning, has proven to be a technolog...
Large language models (LLMs) have demonstrated impressive capabilities in natural language understan...
The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled ...
Language models (LMs) can be directed to perform target tasks by using labeled examples or natural l...
Large Language Models (LLMs) are increasingly used for accessing information on the web. Their truth...